Facial keypoints detection using capsule neural networks
Annotation
The problem of detecting key points of the face is investigated. This problem is quite relevant and important. The existing approaches of solving this problem, which are usually divided into parametric and nonparametric methods, are considered. As a result of the study, it was concluded that, nowadays, the most qualitative results are demonstrated by approaches based on deep learning methods. Two solutions are proposed: a capsule network with dynamic routing and a deep capsule network. The data for the experiments are 10,000 generated faces taken from Kaggle, marked up using MediaPipe. A method of using capsule architectures in neural networks to solve the problem of detecting key points of the face is proposed. The method includes the use of segmentation based on the key points of the face recognized using MediaPipe. Delaunay triangulation was used to build the face mesh. The architecture of a deep capsule network considering semantic segmentation was proposed. Based on the marked-up data, experiments on the detection of key points using the developed capsule neural networks were performed. According to the test results, the loss function reached values in range 2.50–2.90, the accuracy reached values in range 0.87–0.9. The proposed architecture can be used in technologies for comparing the geometry of the face grid of a real person with the geometry of the face grid of a three-dimensional model as well as in further studies of capsule neural networks by researchers in the field of image processing and analysis.
Keywords
Постоянный URL
Articles in current issue
- Analysis of frequency-robust multivariable dynamical systems
- Fractal micro- and nanodendrites of silver, copper and their compounds for photocatalytic water splitting
- Mathematical modelling of tri-layer dielectric OTFT based on pentacene semiconductor for enhancing the electrical characteristics
- Researching carbon dioxide hydrates in thin films via FTIR spectroscopyat temperatures of 11–180 K
- Method for increasing the information value of video data based on the removal of redundant frames and entropy estimation
- Attacker group detection method based on HTTP payload analysis
- Review of national and international standards for categorizing of critical information infrastructure objects
- Criterion of the network infrastructure security
- A novel approach to feature collection for anomaly detection in Kubernetes environment and agent for metrics collection from Kubernetes nodes
- Time parameters linear approximation method in elastic systems
- Role discovery in node-attributed public transportation networks: the study of Saint Petersburg city open data
- Exploring the possibility of predicting users’ career guidance preferences based on analysis of community topics and the gender in the online social network users’ profiles
- Blindness detection in diabetic retinopathy using Bayesian variant-based connected component algorithm in Keras and TensorFlow
- Joint recognition of text and layout in historical Russian documents
- Intelligent clinical decision support for small patient datasets
- Assessment of the readiness of a computer system for timely servicing of requests when combined with information recovery of memory after failures
- Buckling analysis of an orthotropic cylindrical shell structure in the ANSYS Mechanical APDL software package
- Justification of the choice of mobile broadband access technology for building radio communication networks of railway transport
- Comparative performance analysis of DVR & DSTATCOM for distributed generation with gravitational search algorithm
- Estimation of the moments of a quantized random variable
- Experimental method for estimating the dynamic error of devices and sensors under their operating conditions
- Method of type-C liquified natural gas tank modeling based on volume optimization for future “milk-run” exploitation
- Optical properties of borate family nonlinear crystals and their application in sources of intense terahertz radiation
- A model of a refractive fiber optic sensor sensing element based on MMF-SMF-MMF structure using surface plasmon resonance